(19) 



J 



Europdisches Patentamt 
European Patent Office 
Office europeen dee brevets 



(12) 



(43) Date of publicaticvi: 

06.12.2000 Bulletin 2000/49 

(21) Application number: 99U0487,B 

(22) Date of filing: 31 .05.1999 



(11) EP 1 058 392 A1 

EUROPEAN PATENT APPLICATION 

(51) \ntC\7: H03M 13/00 



(84) Designated Oonlractmg States: 


• Salant, Yoram 


AT BE CH CY DE DK £S Fl FR GB GR IE IT U LU 


RoehHaain (IL) 


MC NL PT SE 


• Yoeseff, Ronen 


Designated Extension States: 


Petach TIkva (IL) 


ALLTLVMKROSI 


• Chase, Amir 




Rannat Hasharon (IL) 


(71) Applicant: MOTOROLA, INC, 


• Kutz, Gideon 


Schaumburg, IL 60196 (US) 


Tel Avlv(lL) 


(72) Inventors: 


(74) Representative: RIchardt, Markus Albert 


• Tarrab, Moehe 


Motorola GmbH, 


Holon (IL) 


Intellectual Property Dept., 


• Rozenshein, Zvlka 


Hagenauerstrasse 47 


KfarSaba(IL) 


65203 Wiesbaden (DE) 



(54) Method for implennentlng a plurality of add-compare-select butterfly operations in parallel, 
in a data processing system 



(57) A method for performing a plurality of compare- 
select butterfly operations in a data processing system, 
the performance of the plurality of add-compare-select 
butterfly operation requires a plurality of branch mBtrics, 



a plurality of previous path metrics and plurality of pre- 
vious sun/ivor paths. While performing the plurality of 
add>compare select butterfly operations, data related to 
the previous or next add-compare select butterfly oper- 
ations are transferred. 
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Description 

Field of the Invention 

s [0001 ] Method for implementing a plurality of add-compare-select butterfly operation in parallel, in a data processing 
system, and especially in Very Long Instruction Word data processors. 

BACKGROUND OF THE INVENTION 

10 [0002] The Viterbi algorithm is used for estimation and detection problems in digital communications and signal 
processing. The following references give some examples ot various implementations of the Vrterbi Algorithm : "Im- 
plementing the Viterbi Algorithm" by Hui-Ling Lou, IEEE Signal Processing Magazine, September 1990 pg. 42; -The 
Viterbi Algorithm" by G. David Forney. Jr., Proceedings of the IEEE, vol. 61, No. 3, f^arch 1973. pg. 268. The Viterbi 
algorithm has gained wide acceptance in the area of communications, such as in cellular networks, modems, and 

IS satellite communications, lor decoding convolutional codes, and for estimation and detection problems Briefly, the 
Viterbi algorithm finds the most likely state transition in a state diagram, given a sequence of symbols. A plurality of 
states representing a given time is referred to as a stage. Each state is characterized by a path metric, and each 
transition from a state to another state Is characterized by a branch metric. The path metrics and the branch metrics 
are measures of probability. At each state the Vnerbi algorithm selects the best path by examining the accumulated 

20 metrics from all paths entering the state and selecting the path with the best metric, called a sun/ivqr path, that is the 
maximum-iikelihood path through the trellis. The best path through the trellis may be the shortest path. The other path 
or paths are discarded. 

In some implementations of the Viterbi algorithm the branch metrics are first calculated and stored in a table, and a 
plurality of add-compare-select (ACS) operations are executed to calculate the best path through the trellis for each 

zs state. An ACS operation is commonly known as an ACS butterfly, A typical ACS butterfly involves finding the best path 
from the j*th and the (i+2<^'^))th states ot a previous stage to the 2jth and the (21+1 )'th states of a current stage, wherein 
the maximum number of states within a stage is 2^. Each new state is calculated sequentially by applying the ACS 
butterfly to the previous path metric using the stored branch metrics. The ACS butterfly operation is either involved or 
folbwed by the creation of new path metrics, associated to the survivor path. In one Implementation of the Viterbi 

30 algorithm, the new path metrk; equals the sum ot the branch metric and the path metrics associated with the survivor 
path. 

[0003] The amount of time and memory required to trace a path through a trellis depends on the size of the trellis. 
Very targe trellis, are usually divided to portions, whereas each portbn is decoded separately. 
[0004] The Viterbi algorithm can be implemented in hardware or in software. When implemented in hardware, the 
35 Viterbi algorithm will generally run much faster than the Viterbi algorithm implemented in software. However, imple- 
menting the Viterbi algorithm in hardware is more expensive in terms of semiconductor surface area, complexity, and 
cost. 

[0005] tJ.S. patent 5742621 of Amon describes a method for implementing a plurality of add-compare-select butterfly 
operations in a Viterbi decoder, the Viterbi decoder was Implemented by a data processing system, and a Viterbi shift 
40 left instruction. US patent 5742621 described a Viterbi shift left instructkxi for performing a logical left shift to a trellis 
value in order to identify a survivor trellis state. 

[CK)06] A disadvantage of the method described in U.S. patent 5742621 was that it was based on a relative long 
series of steps. In view of the increasing demand for faster methods for implementing the Viterti algorithm, there is a 
need of a vary fast method for executing the add compare select butterfly operatk)ns in software. 

4S 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0007] While the invention is pointed out with particularity in the appended claims, other features of the invention are 
disctosed by the following detailed description taken in conjunction with the accompanying drawings, in which: 

so 

FIG. 1 is a block diagram of a data processing system, according to an emtpodiment of the invention; 
FIG. 2 is a diagram that illustrates registers within the core of the system of FIG. 1; 
FIG. 3 is a schematic diagram of the data register files, four arithmetic logic units and a shifter/limiter; 
FIG. 4 is a diagram that illustrates a particular embodiment of the one of the MAC units of FIG. 1; 
55 FIG. 5 is a diagram that illustrates a dispatch unit, and a dispatch operation for the core of the system of FIG. 1 ; 

FIG. 6 illustrates in flow chart form. ACS butterfly operation within a Viterbi decoder, in accordance with the present 
invention; 

FIG. 7 illustrates is a schematic form, the content of a pprtran of an internal memory module of the system of FIG. 
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1 , In accordance with the present invention; and 

FIG. 8 illustrates In flowchart form. ACS butterfly operation within a Vlteiti equalizer, in accordance with the present 

invention. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0008] Generally, the present Invention provides a parallel data structure, a mutti arithmetic logic unit system and a 
dedicated add, subtract, compare and select, and parallel Viterbi spedUc shift left instructions for minimizing the number 
of clock cycles required for performing the Viterbi algorithm. 
10 [0009] Specifically, the parallel data structure, the multi arithmetic logic unit system and the dedicated add, subtract, 
comfiare and select, and parallel Viterbi specific shift left instructions reduce the number of clock cycles in a data 
processing system that performs an ACS butterfly operation in a data processing system for choosing the best path 
through a trellis diagram. 

[(K)10] It should be noted that the particular terms and expressions employed and the particular structural and op- 
is erationai details disclosed in the detailed description and accompanying drawings are for illustrative purposes only 
and are not intended to in any way limit the scope of the invention as described in the appended claims, 
[0011] Referring to FIG. 1, an embodiment of a processing system 10 is illustrated. The processing system 10 in- 
cludes a processor core 12, a system interface unit (SlU) 14. a direct memory access unit 16, a peripheral 18, such 
as a serial communication port or timer, internal merrvDry modules 20, 22. and an external memory interface module 
20 1 9. The processing system 1 0 may also be referred to as a data processor. 

[0012] Processor core 1 2 includes address register file 26, program sequencer 24, data register files 28, 29. address 
arithmetic logic units 30 (also reierred to as address generation units (AGU)). multiply and accumulate (MAC) units 
(32) (also referred to generally as data arithmetic logic units (DALU)), AND bit field and logic unit 34. The address 
ALUs 30 are coupled to the address register file 26 via internal bus 60, The multiply and accumulate units 32 are 
2S coupled to the c^ta register files 28, 29 via mlernal bus 62, and bit field unit 34 is coupled to the data register files 28, 
29 via internal bus 64, The program sequencer 24 is coupled via the instruction bus 44 to the address ALUs 30, the 
DALUs 32. the bit field unit 34. and the instruction expansion accelerator 36. 

[0013] The system tO further includes program bus 38, first data bus 40. second data bus 42. peripheral bus 88, 
direct merriory access (DMA) bus 84, and external memory interface bus 102. Program bus 38 is coupled to program 
30 sequencer 24 via bus 46. to SlU 14 via bus 66, and to internal memory 20, 22 via buses 72 and 82 respectively Data 
buses 40. 42 are coupled to address register file 26 via buses 48. 50. to data register files 28, 29 via buses 52, 54, 
and to Instruction expansion accelerator 36 via buses 56, 58. Data buses 40. 42 are coupled to memory 20, 22 via 
buses 74-80. 

I [0014] DMA bus 84 is coupled to SlU 14 via bus 90, to DMA 16 via bus 92. to peripheral unit 18 via bus 94, and to 

ss memory units 20, 22 via buses 95 and 97 respectively. Peripheral bus 88 is coupled to the SlU 14 via bus 96. to DMA 
1 6 via bus 98, and to peripheral unit 1 8 via bus 1 00. External memory bus 1 02 is coupled to external memory interface 
module 19 and is coupled to external memory (not shown) in communication with the system 10. In the illustrated 
embodiment, the program bus 38 is 128 bits wide, and the other buses 40. 42. 64, and 86 are 64 bits wide. For con^ 
venience of explanation, the buses which are used to exchange addresses were not shown. 

40 [0Q1S] Referring to FIG. 2, a particular embodiment of registers within the core 12 of the system 10 is disclosed. As 
illustrated, the address register file 26 includes registers R0-R7, stack pointer (SP). N0-N3. M0-M2, MCTL. SA0-SA3, 
LC0-LC3. The program sequencer 24 includes the program counter, status register, and operating mode and status 
registers. The data register file 26 includes registers D0-D7 and the data register file 29 includes registers D8-D1 5. In 
an alternative embodiment, only a single register file may be used to save cost, such as with the one or two MAC 

4S conVtgurations. fn other high perform^ce applications, more than two register files may also be used. 

[(K)16] FIG. 3 is a schematic diagram of the data register files, four ALU units and shifter/limiter. It is to be understood 
that the present invention is not to be limited by the above exemplary configurations and is not limited to the particular 
number of MAC units 32 or the particular arrangements thereof. Shifter/limiter 110 is used to perform a Viterbi shift left 
operaXion on the survivor trellis value, and to insert a predetermined bit to the least significant bit of the survivor trellis 

50 value. 

[0017] Referring to FIG. 4, a particular embodiment of a MAC unit 32 is illustrated. The MAC unit 32 includes a 
multiplier 110 and an adder 112. The multiplier 110 receives data input from the data register files 28, 29, and multiplies 
the data elements 111, 114 to produce a multiplied output 116 that is input to the adder 112. The adder sums a second 
data input 1 20 and the multiplier result 1 16 to produce an accumulated result 1 22 that is output back to the data register 
55 file 28, 29. 

[0018] Referring to FIG. 5, illustrating a dispatch unit, and a dispatch operation for the core of the system of FIG. 1 . 
Internal memory 20 and 22 store instructbn fetch sets. Preferably, each instructk)n fetch set comprises of fixed number 
of instructions. An fns(ructk>n execution set is usually a subset of an instruction fetch set, usually a single instruction 
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fetch set is comprised of a single instruction execution set, but can also have instructions from other instruction exe- 
cution sets. An instruction execution set comprises of a plurality o! instructions which can be executed in parallel by 
the various execution units within system 10. 

[0019] The embodiment illustrates a dispatch unit 220. eight instruction registers 2401-2409, collectively denoted 
s 240, for storing eight instructions every clock cycle, a program memory (either program memory 20 or 22)» various 
data arithmetic togic units (DALUs) 321-324 (collectively denoted 32 in FIG. 1), address generation units (AGUs) 
301 -302. 324 (collectively denoted 30 in FIG, 1 ), and control unit 400. The dispatch unit 220 and instaictions registers 
240 may form the program sequencer 24. In the illustrated embodiment, dispatch unit 220 groups the instructions into 
execution sets, and then simultaneously dispatched them via a routing mechanism to the appropriate execution units 
10 301-302, 321-324, for parallel decoding and execution. Simultaneous dispatch means that execution of each of the 
grouped instructions is initiated during a common clock cycle. In the illustrated embodiment of the system 10. execution 
of each of the grouped instructions is initiated during a common clock cycle, but one or more of the grouped instructions 
may complete execution during a different clock cyc\e. 

[0020] System 10 uses a pipeline execution method that includes the execution stages of program prefetch, program 

IS fetch, dispatch and decode, address generation , and execute. 

[0021] FIG. 6 illustrates in flow chart form, method 300 for implementing ACS butterfly operation within a Viterbi 
decoder, in accordance with the present invention. Rectangular boxes 310, 320 and 330 represent initialization steps 
that are folbwed by rectangular boxes 340. 350 and 360. Rectangular boxes 340 and 350 represent steps which are 
followed in order to perform one iteration, or kx)p, of the ACS butterfly for a data processing system that uses the 

20 Viterbi decoder algorithm to find the best path from a previous state to a present state of a trellis. Rectangular box 360 
represents a step that is performed after the completion of the last iteration of steps 340 and 350. In one embodiment, 
the ACS butterfly is included as a DO loop within a set of instructions for decoding convolutional codes using the Viterbi 
decoder algorithm. 

[0022] The ACS butterfly performs the calculations required to select the path through the trellis that represents the 
2S rnost likely sequence that was generated by a convolutional encoder. A plurality of the ACS butterfly operations are 
repeatedly executed a predetermined number of times, or iterations, to cafoulate the best path through the trellis for 
each stage. System 10 has the ability to execute a plurality of ACS butterfly operaltons in parallel. Each new state is 
calculated sequentially by applying the ACS butterfly to the previous path metric using the stored branch meXncs. 
[0023] Note that there are several known methods for calculating the most likely sequence through the trellis. The 
30 method of the illustrated embodiment requires maximizing accumulated path metrics by choosing the path metric with 
the largest value. Other embodiments may require minimizing accumulated path metrics to choose the shortest path 
through the trellis. Some methods use the Eucledian distance as a distance measure, while other methods use the 
Hamming distance as a distance measure. 

[0024] Prior to the performance of the ACS butterfly operation, branch metrics are first calculated and stored in 
35 internal memory module 20 or 32, pointer by an effective address stored in register R1 , kx:ated in address register file 
26. The cateulation can be perfomried during initializatk)n step "START" 310. 

[0025] For convenience of explanation it is assumed that branch metrics, path metric and trellis values are stored in 
internal memory module 20. 

[0026] Each branch metric is defined as a measure of distance determined by some form of comparison between a 

40 received input signal and a corresponding branch in the trellis diagram. The term "signal" refers to information com- 
municated between an encoder of a transmitter and a decoder of a receiver of a communications system. The branch 
metrics are stored in a table, denoted as branch metric table. The whole branch metric table can be stored within the 
data register file, thus there is no need to fetch portions of the brar\ch metric table during stages 340 and 350. Preferably, 
a portton of the branch metric table is stored within the data register file. Other portions of the table metrics are fetched 

^ during either step 340 and 350. 

[0027] For convenience of explanation it is assumed that the convolutional code has a constraint length of five and 
a rate of 1/2, and that the first two ACS butterflies calculation involve the first, second, ninth and tenth states (PMOo, 
PMto, PM8o. PM9o) of a previous stage, the first, second third and fourth states (PMOn. PMIn, PM2n.PM3n) of the 
next stage, and the first and second branch metrics BMO. 6M1. The sun^tvor paths associated with the first, second, 

50 ninth and tenth states of the previous stage are denoted T0o,T1o,T8o,T9o. 

[0028] As indicated by path 31 5, step 310 is followed by step 320. During step 320 the branch metric table is moved 
into a plurality of registers located in data register files 28,29 . Preferably a portion of the branch metric table is moved 
to a plurality of registers located in the data register files 28,29. At step 320, four branch metrics BM0-BM3, which are 
used in the next four ACS butterfly operations are moved into registers 04 and 05, from a predetermined location of 

ss internal menfK>ry rTK)dule 20, that is pointed to by an effective address stored in register R1 . The effective address is 
incremented by one. 

[0029] As indicated by path 325. step 320 is foltowed by step 330. In step 330 four previous path metrics, and four 
survivor paths, which are used in the next first two ACS butterfly operatiorys are moved into registers DO, D2, D1 and 
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03, The 16 least significant bits of registers D0-D1 5 are referred to as DO.I - 01 5.1 accordingly, the 16 bits which foltow 
DO.I - D15.I are referred to as DO,h - DIS.h. accordingly. 

[0030] After step 330 is executed. DO.I stores PMOo. DO.h stores PMIo. 01 .1 stores TOo, 01 .h stores T1o, 02.1 stores 
PM8o, D2.h stores PM9o, D3.I stores T8o. D3.h stores T9o. 04,1 stores BMO. D4.h stores BM1, 06.1 stores BMO. D6.h 

s stores BMl As indicated by path 335, step 330 is followed by step 340. 

[0031] Step 340 is the first step of a DO loop for the ACS butterfly operation. At step 340. the "Add" operations for 
two ACS butterflies are performed in parallel, and preferably in the same single clock cycle. The content of 00.1 is 
added to the content of D4.I and the result is stored in 04.1. The content of DO.h is added to the content of D4.h. and 
the result is stored in D4.h. The content of D6.I is added to the content of 02. 1, and the result is stored in 06.1 The 

to content of D6.h is added to the content of D2.h, and the result is stored in D6.h. The content of D6J is subtracted from 
the content of 02.1 and the result is stored in 02.1. The content of D6.h is subtracted from to the content of D2.h and 
the result is stored in D2.h. The content of 04.1 is subtracted from the content of 00.1 and the result is stored in DO.I. 
The content of 04.h is subtracted from to the content of DO.h, and the result is stored in DO.h. 
[0032] After step 340 is executed, OOJ stores PMOo-BMO, DO.h stores PMI0-BMI, 02.1 stores PM8o-BhA0, D2.h 

IS stores PMSo-BMI , D4J stores BMO+PMOo, D4.h stores BM1 +PMI0, D6.I stores BMO+PM80. 06,h stores BMl +PM9o. 
[0033] During step 340, and in parallel to these add and subtract operation, a block of data is moved either from data 
register files 28.29 to internal memory module 20 or vice verse. The block of data which is moved to data register file 
can comprise of: a portion of the branch metric table, to be used in the next ACS butterfly operations, the portion being 
moved to the data register file; a plurality of previous path metrics and the survivor paths, to be used in the next ACS 

20 butterfly operations, moved to data register file. The block of data whk^h is moved to internal memory nxxJule 20, usually 
comprises of a plurality of the updated sun/ivor paths, and the new path metrics, associated to the survivor path metrics. 
[0034] The survivor paths are updated by using shifter/limiter 110. A surviyor path entering a state is updated after 
the ACS operation involving this stale was performed 6umg step 350, in one of the previous iteration of the DO loop. 
Each survivor path is moved to the least significant bits of shffter/ljmiter 108, which performs a logical left shift on the 

2s sun/ivor path, and a bit indicating which signal was most likely generated by the convolutional encoder is ksaded into 
the least significant bit of shifter/limiter 108. Conveniently, this bit is provided from either of flags VF0-VF3. Preferably, 
this update is done in the same ckx:k signal in which the updated survivor paths are sent to a predetermined memory 
location of internal memory module 20, whereas this location is pointed to by an effective address stored in register R4. 
[0035] As indicatisd by path 345, step 340 is followed by step 350. During step 350 the first, second, ninth and tenth 

30 previous path metrics PMOo, PMIo, PM8, PM9o are examined in view of the branch metric BMO, BM1 to determine 
the four survivor paths. Four flags. VF0-VF3 indicate which paths are the sun^ivor paths. The result in DO.h (PMI0- 
BM1) is compared to the result in D6.h (PM9o+BM1). If the fomier is greater than the latter, the content of DO.h is 
moved to DB.h, and a fourth flag VF3 is reset, else VF3 is set. The result in DO.I (PMOo-BMO) is compared to the result 
in 06.1, (BMO+PMBo), if the former is greater than the latter then the content of DO.I is moved to D6.1, and a third flag 

35 VF2 is reset, else VF2 is set. The result in D4.h (PMI0+BMI ) is compared to the result in D2.h (PM9o>BM1 ) and if the 
former is greater than the latter then the content of D4.h is moved to 02.h. and a SQCon6 flag VFI is reset, else VFl 
is set. The result in D4.I (PMOo+BMO) is compared to the result in D2J (PM80-BMO) and if the former is greater than 
the latter then the content of 04. 1 is moved to 02. 1 and a f i rst flag VFO is reset, otherwise VFO is set. Thus, flags VF0-VF3 
indicate where the survivor paths and the corresponding path metrrcs are stored. The sun/ivor paths will be updated 

40 during a further iteration of steps 340 and 350 or during step 360 in which the survivor path is updated and the updated 
survivor paths and the new path metrics are sent to internal memory module 20. Usually step 360 follows step 350 
after the ASC butterfly calculations of an entire stage is done. As long as there is there are additional ACS butterfly 
operatbns to execute, step 350 is followed by step 240, as indicated by path 355. 

[0036] In the case where a result in CX).h. OOJ, D4h. 041 is equal to a result In 06.h, 06.1, D2.h. 02.1 accordingly, 
45 there are several ways to determine the survivor path. For example, the determination may be arbitrary However, this 
determinatbn is not important for describing the invention and will not be discussed further 
[0037] FIG. 7 illustrates is a schematic form, the content of a portion 30@ 1 of internal menwry nrKxiule 20 in accord- 
ance with the present invention. 

[0038] Portbn 301 comprises of a contiguous set of memory locations vvhich is accessed as a circular buffer. Portion 
50 301 is divided to two btocks 301 @1 and 301 @2. Each block contains eight groups. Each group is comprised of four 
consecutive merriory bytes. 

[(X>39] One block stores data such as survivor paths and path metrics associated with a pervious stage while the 
other block stores data associated with the next stage. After the data of the next stage is updated, the former block 
stores data associated with a next stage while the latter block stores data associated with the previous stage. 
55 [0040] For convenience of explanation it is ass umed that first block 30 1 @ 1 stores data associated with the prevtous 
stage (PMOo - PM15o. TOo • T15o), while second block set 301 @2 stores data associated with the next stage (PMOn 
-PM15n.T0n-T15n). 

[0041] A branch metrk: table or a portion of the table are stored in a predetermined location of internal memory 
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module 20, that is pointed to by an effective address stored in register Rl . 

[0042] Resistors R2 and R3 store pointers to nr^emofy locations located in first block 301 @ 1 , these memory locations 
store the sun/ivor paths and the path metrics of a previous stage, required to perfomi two ACS butterfly operations. 
For example, R2 stores a pointer to the first group of first block 310@1 which stores PMQo, PM1o. TOo, and T1o. R3 

5 stores a pointer to the fifth group of first block 310@1 which stores Pf^9o, PM8o. T8o and T9o. 

[0043] Resistors R4 and R5 store pointers to memory locatbns located in second block 301 @2, these memory 
locations will store the treills values and the path metric of the next stage. For example, R4 stores a pointer to a set of 
two 16-bit5 memory words which store PMOn, PMIn, TOn, and Tin. R5 stores a pointer to a set of two 16-bits memory 
words which store PM2n, PM3n. T2n and T3n. 

10 [0044] After the sun/ivor paths and the path metric are moved to registers in data register tile 28, 29 , R2 and R3 are 
incremented by a predetermined constant, for pointing to two groups which contain data required for the next two ACS 
butterfly operatkjns. For example, after pointing to the first and fifth groups. R2 and R3 are incremented by 2, so that 
R2 points to the second group and R3 points to the sixth group of first block 310@ 1 . 

[0045] Registers R4 and R5 point to the first and second groups of second block 310@2. which store the survivor 
IS paths and the path metric of a next stage (PMOn. PMIn, TOn.TIn, PM2n, PM3n, T2n, T3n) after the data pointed by 
R2 and R3 undergoes two ACS butterfly operations. R4 and R5 are then incremented by a predetermined value, 
causing them to point to two groups that will contain the next results of two ASC butterfly operations. For example, 
after pointing to the first and second groups of second b\ock 310@2, R4 and FiS are incremented by 4, causing them 
to point to the third and fourth groups of second block 310® 2. 
20 [0046] Usually, a plurality of ASC butterfly operatk>ns are grouped together in order to reduce the number of initial- 
izatkxi steps required for the execution of a plurality of ACS butterfly operations and in order to utilize in a efficient 
manner the capability to transfer data relating a ACS operation while performing another ACS operation. 
[0047] An exemplary assembly code (written in capital letters), and a detailed explanation of each code line, and of 
each instruction are shown. This assembly code performs 8 ACS butterfly operations : 

25 

I kemal initialization 



// read first four branch metrics to registers D4, D5 
M0VE2L (R1)+N0,D4D5 



7 kemal 

// read path metric and survivor paths for first two ACS butterfly operations 

1 TFR D4. D6 MOVE.2L (R2)+,D0. Dl MOVE.2L (R3)+. D2 D3 

45 // ADD operation for first two ACS butterfly operation, read next four branch 

metrics to registers D 1 3, D 14. 

2 ADD2 D0,D4 SUB2 D6,D2 SUB2 D4,D0 ADD2 D2,D6 
M0VE.2L(R1H,D14D13 



// Compare and Select operations for first two ASC butterflies operation, read 

ss 

next path metrics and survivor paths. 
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MAX2VIT D4.D2 M AXlVIT D0,D6 M0VE2L (R2)+, D8 D9 

MOVE.2L(R3)+,D10D1K 

// ADD operation for third and fourth ACS butterfly operation, move survivor 
path metrics resullifig from fust two ACS butterfly operation to memory, 
update survivor paths, insert and send lo memory. 

ADD2 D8,D12 SUB2 DI4, DIO SUB2 D12, D8 ADD2 D10,D14 
VSL.4W D2,D6,D1 ,P3,(R4)+N0 VSL4F D2,D6.D I ,P3,(R5)+NO 

//Compare and Select operations for third and fourth ASC butterflies 
operation , read next path metrics and trellis values. 

MAX2VITD12,D10 MAX2V1T D8.D14 TFR D5.P6 TFR D5,D4 
MOVE.2L(R2)+,D0PI MOVE2L(R3)4.,D2D3 

// ADD operation for fifth and sixth ACS butterfly operation^ move survivor 
path metrics resulting from third and fourth ACS butterfly operation to 
memory, update survivor path and send to memory. 

ADD2D0,D4 SUB2 D6,D2 SUB2 D4»D0 ADD2D2,D6 
VSL.4W D10,DI4,D9,Dl 1,{R4)+N0 VSL.4FDI0,D14,D9.DI l,(R5)+N0 

// Compare and Select operations for the seventh and eighth ASC butterflies 
operation, r^ad next path metrics and survivor paths. 

M AX2Vrr D4,D2 M AX2Vrr D0.D6 MOVE.2L (R2)+. D8 D9 

MOVE2L(R3)+,DJOD1K 

// ADD operation for seventh and eighth ACS butterfly operation, move 
survivor path metrics resulting from fifth and sixth ACS butterfly operation 
to memory, update survivor paths and send to memory. 
ADD2 D8.D12 SUB2 D14, DJO SUB2 DI2. D8 ADD2 D10,D14 
VSL.4WD2,D6,D1,D3,(R4)+N0 VSL.4FD2.D6,D1,D3,(R5)+N0 

// Compare and Select operations for seventh and eighth ASC biitterflies 
operation, read next four branch metric values to registers D4. D5. 
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9 MAX2VITD12,D10 MAX2VIT D8,D14 M0VE.2L(R1)+, D4D5 

5 

// move survivor path metrics resulting from seventh and eighth ACS 
butterfly operation to memory, update survivor paths and send to memory 

10 VSL.4W D10,D14,D9,D1 K(R4KN0 VSL.4F DI0,D14,D9,D1 l.(R5KN0 

[0048] The instructions which formed the exemplary code perform the following operations : 
[0049] MOVE.2L (R)+NO, D1 D2 instruction moves two 16-bit words from a memory location within internal memory 
module, pointed by register R. to destination registers D1 and D2. The content of register R in incremented by NO after 
the two words are moved to registers D1 and D2, MOVE.2L is the operation code. 

[0050] R, D1, D2 and D3,D4 are registers which are located either in address register 26 file and the data register 

files 28. 29. 

[0051] TFR D1 ,D2 instruction transfers the content of a register D1 to register D2 TRF is the operation code. 
[0062] ADD2 D1 D2 adds the content of the F/2 most significant bits (i.e.- D1 .h) of F- bit register D1 to the F/2 most 

20 significant bits of register 02 (i.e.- D2.h). stores the result at D1 .h. adds the content of the F/2 feast significant bits (1. 
e.- 01. 1) of register 01 to the F/2 least significant bits of register 02 (i.e.- 02.1) and stores the result at 01.1. Carry is 
disabled between the F/2 least significant bit and the F/2 mosX significant bit of 02. A0P2 is the operation code. 
[0053] SUB2 01 D2 subtracts the content of the D2.h from D1 ,h. stores the result at 01 h, subtracts the content of 
02.1 from the content of 01.1 and stores the result at 01.1. Carry is disabled between the F/2 least significant bit and 

25 the F/2 most significant bit of 02. SUB2 is the operation code. 

[0054] MAX2VIT 01 02 performs two comparisons. It compares between the contents of 01 .1 and 02.1. and if the 
former is greater than the latter the content of the is moved to 02,1 and a flag is reset, else the flag is set. ft also 
compares between the contents of 01 .h and 02. h, and if the former is greater than the latter the content of the is moved 
to 02.h and another flag is reset, else the other flag is set. The flags indicate where the survivor paths are stored. 

30 Preferably, the registers which their content is involved in the comparison determine which flags are set/reset as a 
result of the two comparisons. For example, to1/0(2ViT 04 02 can set/reset first and second flags VFO and VF1 and 
MAX2VIT 00 06 can set/reset third and fourth flags VF2 and VF3. MAX2VIT is the operation code. 
[0055] VSL.4W 01 :D2:03:D4, (R)+N0 moves the content of 01 .1 to a first predetermined location of internal memory 
module, 20, that is pointed to by an effective address stored in register R; moves the content of 02.1 to a second 

3S predetermined location of internal memory module 20; performs a togical shift logical left on the survivor path and 
inserts a predetermined bit in the least significant bit. the location of the survivor path (either 03.1 or 04.1) is determined 
by the flags, and sends the updated suwivor paths to a third and fourth predetermined location hi memory. Conveniently, 
the distance between the first, fourth, third and fourth predetermined locations in internal memory module 20 is NO. 
The content of R is incremented by 8*N0. For example, VSL.4W 02:06:03:01 moves the content of 02,1 and 06.1 to 

40 first and second locations in internal memory module 20 and if VFO-0 then it updates the trellis value stored in 01 .1 by 
performs a logical left shift and inserting 0 to the least significant bit, and moves the updated trellis value to a prede- 
termined location in internal memory module 20. If VF0=1 it updates the content of 03.1 and moves the updated trellis 
value, wherein the least significant bit equals T, to a predetermined location in internal memory module 20. If VF2~0 
then it updates the survivor path stored in 01. 1 by performs a logical left shift and inserting 0 to the least significant bit. 

^ and moves the updated trellis value to a predetermined location in internal memory module 20. If VF2=1 it updates the 
content of 03.1 and nooves the survivor path, wherein the least significant bit equals "1". to a predetermined location 
in internal memory module 20 

[0056] VSL4H 0: 1 02:03:04, (R)+N0 is analogues to VSL4W but operates on the upper half (01 -h,02.h,03.h.04.h) 
of the registers, and also checks other flags in order to determine whether the sun/Ivor trellis values are located in 03.h 

50 or 04.h. For example, VSL.4F 02:06:03;D1 moves the content of 02.h and 06.h to two predetermined locations in 
internal memory module 20 and if VF1=0 then it updates the survivor path stored in 01 ,h by performs a logical left shift 
and inserting 0 to the least significant bit, and moves the updated survivor path to a predetermined location in internal 
memory nKxJule 20. If VF1=1 it updates the content of D3.h and moves the updated trellis value, wherein the least 
significant bit equals "1", to a predetermined location in internal memory module 20. If VF3=0 then it updates the 

55 survivor path stored in 01. h by performs a logical left shift and inserting 0 to the least significant bit, and moves the 
updated sun/ivor path to a predetermined location in Internal memory nnodule 20. It VF3=1 it updates the content of 
03.h and moves the updated suwivor path, wherein the least significant bit equals *1", to a predetermined locatbn in 
internal memory module 20 
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[0057] FIG. 8 illustrates in flow chart form, method 400 for implementing ACS butterfly operation within a Viterbi 
equalizer, in accordance with the present invention. The Viterbi equalizer detects symbols which were corrupted by 
noise. Method 400 is analogues to method 300 but involves the calculation of the branch metrics, and fetching output 
signals of a matched filter. 

s [0058] Rectangular boxes 410, 430 and 440 represent initializalion steps which are followed by rectangular boxes 
450, 460 and 470. Rectangular boxes 450 and 460 and 470 represent steps which are followed in order to perform 
one iteration, or loop, of the ACS butterfly for a data processing system that uses the Viterbi algorithm to find the best 
path from a previous state to a present state of a trellis. Rectangular box 480 represents a step which is performed 
after the completion of the last iteration of steps 450 and 460. In one embodiment, the ACS butterfly is included as a 

10 DO loop within a set of instructions for detecting symbols which were corrupted by noise, using the Viterbi decoder 
algorithm. 

[0059] The ACS butterfly performs the calculations required to select the path through the trellis that represents the 
most likely sequence that was transmitted via a communicatton channel and was corrupted by noise and by fntersymbol 
interference . A plurality of the ACS butterfly operations are repeatedly executed a predetermined number of times, or 
'5 iterations, to calculate the best path through the trellis for each stage. System 10 has the ability to execute a plurality 
of ACS butterfly operations in parallel. Each new state is calculated sequentially by applying the ACS butterfly to the 
previous path metric using the stored branch metrics. 

[0060] Each branch metric is the sum of a predetermined branch metris and the output signal of a matched filter 
Prior to the performance of the ACS butterfly operation, the predetermined branch metrics are calculated and stored 
20 in internal memory module 20 or 32. pointed by an effective address stored in register R1 , located in address register 
file 26. The calculation can be performed during initialization step "START" 410. 

[0061] The predetermined branch metrics are stored in a table, denoted as predetermined branch metric table. The 
prQ^eiermmd branch metres which are assoc\aXQ6 with the transition from a cunetxi stage to the next stage are equal 
throughout the whole trellis. Furthermore, the branch metrics associated with the upper half of the states of each stage 

26 equal the branch metrics associated to the lower half of each stage. For example, if each stage has 16 states, and 
there are 96 stages, and the predetermined branch metric which is associated to the transition from the s*th state of 
the t'th stage to the Ith state of the (t+1 )th stage is denoted as PBIVIs.l.t then PMB1 ,1,1= PMB1 ,1,2 = PMB1 .1.3 = ... 
= MB1 , 1 , 96 = PMB 1 6. 1 6. 1 = PMB1 6. 1 6,2 = PfVIBl 6. 1 6.3 . = PMBl 6. 1 6.96 = - PMB1 ,2, 1 = - PMB1 ,2,2 = -PMB1 .2,3 
= . . . = -MB1 ,2.96 = -Pr^B1 6. 1 5. 1 = -PMBl 6. 1 5,2 ^ -PIVIB1 6, 1 5.3 = ... ^ -PMBl 6.1 5.96. And if the branch metrics which 

50 is associated to the transition from the s'th state of the tlh stage to the rth state of the (1+1 yth stage is denoted as BMs, 
l,t. and the output signal of the match filter which Is associated to the transitions from the tlh stage to the (t+l yth stage 
is denoted as MFt then BMs.U=MFt+PBMs.l,t. 

[0062] The whole predetermined branch metric table can be stored within the data register file, thus there is no need 
to fetch portions of the predetermined branch metric table during stages 450-470. During step 410 the predetermined 

35 branch metric table is moved into a plurality of registers located in data register files 28,29. Preferably a portion of the 
branch metric table is moved to a plurality of registers located in the data register files 28,29. At step 410, eight pre* 
determined branch metrics BM0-BM7. which are used in the next four ACS butterfly operations are moved into registers 
D4 and 05. from a predetermined location of internal memory module 20. that is pointed to by ari effective address 
stored in register Rl . The effective address is incremented by one. 

40 [0063] As indicated by path 415, step 410 is followed by step 430, which is anatogues to step 330 of method 300. 
As m6kiaXe6 by path 435, step 430 is fol towed by step 440. 

[0064] During step 440, the "Add" operations for two ACS butterflies are performed in parallel, and preferably in the 
same single clock cycle. Four predetermined branch metrics are added to four path metrics. The "Add" operations 
which are performed during step 440 are analogues to the "Add" operations pefformed in step 340 of method 300. 

45 [0065] As indicated by path 445, step 440 is followed by step 450 which is the first step of a DO loop for the ACS 
butterfly operation. During step 450 four path metrics are examined in view of four predetermined branch metrics to 
determine the four survivor path metrics and the corresponding trellis. Four flags, VF0-VF3 indicate which are the 
survivor paths. The "compare and select" operations which are performed during step 450 are analogues to the "com- 
pare and select" operations performed in step 350 of method 300, 

so [0066] During step 450, and if there are more ACS butterfly operations to perform, output signals of a matched fitter 
and predetermined branch metrics required for the next two ACS butterfly operations are fetched from internal menriory 
module 20 to a plurality of registers of the register data file. 

[0067] As indicated by path 455, step 450 is followed by step 460 in which the survivor path metrics are updated by 
adding the output signal of the matched filter corresponding to the transition from the previous stage to the next stage 
55 to the survivor path metrics, and if there are more ACS butterfly operations to perform, the path metrics and survivor 
paths required for the next two ACS butterfly operations are fetched from interrial memory module 20 to a plurality of 
registers of the register data file. 

[0068] As indicated by paths 465 and 466, if there are more ACS operations to execute step 460 is followed by step 
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470. otherwise step 460 is followed by step 480. 

[0069] During step 480 the survivor path is updated and the updated survivor path and the sun/ivpr path metrics are 
sent to Interna! memory module 20. The update operation and sending the data to internal memory module are ana- 
logues to the update and send operations which are executed in step 360 ot method 300. 
5 [0070] Step 470 is analogues to step 470 but further includes the performance ot "add" operations for two ACS 
butterfly operations. As indicated by path 475, step 470 is followed by step 450. 

[0071] Another exemplary assembly code (written in capital tetters), is shown. This assembly code is analogues to 
the previous assembly code, but Has further instructions for calculating of the branch metrics, and fetching output 
signals of a matched filter This assembiy code performs 8 ACS butterfly operations : 
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/ kemal initialization 

//read predetermined branch metrics and matched liltcr output signals and read 
first four predetermined branch metrics to registers D4, D5 

MO YE.L (R 1 H,D7 MO VE.2L (R I HN0,D4 D5 

/ kernal 

// read path metrics and survivor path for the first two ACS butterfly operations 

M0VE.2L (R2)+,D0, Dl MOVE,2L (R3)+, D2 D3 

// ADD operation for first two ACS butterfly operation, read next four branch 
metrics to registers D13, D14. 

ADD2D0,D4 SUB2D6.D2 SUB2D4.D0 ADD2D2.D6 

// Compare and Select operations for first two ASC butterflies operation, read 
next predetermined branch metrics and matched filter output signal. 

MAX2Vrr D4,D2 MAX2VIT D0,D6 M0VE.2L (ROH, D12.D13 
M0VE.L(R1),D15 

// Calculate updated branch metrics, read path metrics and survivor paths for 
third and fourth ACS operations, 

4 TRF D13,D14 ADD2 D0,D2 SUB2 D0.D6 MOVE.2L (R2)+, 
D8,D9 MOVE.2L (R3)+, DIO Dl 1 

// ADD operation for third and fourth ACS butterfly operation, move survivor 
path metrics resulting from first two ACS butterfly operation to memory. 
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update survivor palhs, and send lo memory. 

ADD2 D8,D12 SUB2 D14, DIO SUB2 DI2, D8 ADD2 D10,D14 
VSL.4W D2.D6.DUD3,(R4)+N0 VSL.4F D2.D6,D1,D3,(R5)+N0 

// Compare and Select operations for third and fourth ASC butterflies 
operation , read next predetermined branch metrics and output signals of 
matched filter. 

MAX2Vrr DI2,DI0 MAX2V1T D8,DI4 TFR DI5.D8 
MOVE.L (R 1 )+. D7 M0VE.2L (R0)+, D4 D5 

//// Calculate updated branch metrics, read path metrics and survivor paihs for 
third and fourth ACS operations. 

TRFD5.D6 ADD2D8,D10 SUB2D8,D14 MOVE.2L(R2HN3. D0,D1 
MOVE.2L(R3)+N3,D2D3 

// ADD operation for fifth and sixth ACS butterfly operation, move survivor 
path metrics resulting from third and fourth ACS butterfly operation to 
memory, update survivor path and send to meniory. 

ADD2D0,D4 SUB2D6.D2 SUB2D4.D0 ADD2D2,D6 
VSL.4W D10,D14.D9J[)1 1,(R4)+N0 VSL.4F D10,DI4,D9,Dl I .(R5)+N0 

// Compare and Select operations for the seventh and eighth ACS butterflies 
operation, read next predetermined branch metrics and output signals of 
matched filter. 

MAX2Vrr D4.D2 MAX2VIT D0,D6 MOVEL(Rl). D15 

MOVE.2L(R0KD12D13. 

//// Calculate updated branch metrics, read path metrics and survivor paths for 
seventh and eighth ACS operations. 
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10 TRFD13,D!4 ADD2D0,D2 SIJB2D0,D6 M0VE.2L (R2H, D8,D9 
MOVE.2L(R3)4.,D10D11 

// ADD operation for seventh and eighth ACS butterfly operation, move 
survivor path metrics resulting from fifth and sixth ACS butterfly operation 
to memory, update survivor paths and send to memory. 

11 ADD2D8,D12 SUB2DI4,D10 SUB2D12,D8 ADD2D10,D14 
VSL.4W D2,D6,D1,D3,(R4)+N0 VSL.4F D2,D6,D1,D3,(R5)+N0 

// Compare and Select operations for seventh and eighth ASC butterflies 
operation. 

12 MAX2Vrr D12,DI0 MAX2VIT D8,D14 
// Calculate updated branch metrics 

13 TRFD5,D6 ADD2D8,D10 SUB2D8,DI4 

//move survivor path metrics resulting from seventh and eighth ACS butterfly 
operation to memory, update survivor paths and send to memory. 

14 VSL.4W DI0,D14,D9,D1 1,(R4)+N0 VSL.4F D10,D14,D9,D1 1,(R5)+N0 

[0072] In a further embodiment of the invention, system 10 can perform simultaneously a plurality of ACS butterfly 
operations. For example, system 10 performs each of the "ADD". "Compare and Select" operations on L K-bit words 
in a single clock cycle, and simultaneously fetch the required data to perform the ACS butterfly operations. If K^16 and 
L>4, the performance of L ACS butterflies operation requires to expand data buses 40 and 42, expand data register 
files 28,29 and to add more MAC 32 and AGU 30 units. For example, if 1=4 and K=16, than system 10* can have 8 
DALU units. 4 AGU units, and data buses 40 and 42 will be 128-bit wide. Thus, 4 branch metric required for 4 ACS 
butterflies when implementing a Viterbi decoder. Furthermore, either eight path metrics required for 4 ACS butterflies 
operations of a Viterby decoder or the 8 sun/ivor metric paths and the survivor paths resulting from the execution of 4 
ACS butterfly operations can be moved from and to the data register files and the internal memory module in a single 
clock cycle. System 10 can also have MAC units, which can perform "ADD" and "Select and compare" operations on 
J K-bit words. For example, if K=:16 and J=4. than each MAC unit can handle simultaneously four 16-bit words. The 
instruction set will be altered, and instructions ADD2, SUB2. MAX2VIT will be changed to ADD4. SUB4, MAX2VII 
which will handle each of the four 16-bits words which are sent to the MAC unit separately. For example, when per* 
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forming either of these instructions, the carry bits from the 1 5.31* and 47 bits will be disabled. 

[0073] Thus, there has been described herein an embodiment including at least one preferred embodiment of an 

improved method for implementing a plurality of ACS butterfly operation in parallel in a data processing system. It will 

be apparent to those skilled in the art that the disclosed subject matter may be modified in numerous ways and may 

assume many embodiments other than the preferred form specifically set out and described above. 

[0074] Accordingly, the above disclosed subject matter is to be considered illustrative and not restrictive, and to the 

maximum extent allowed by law, it is intended by the appended claims to cover ail such modifications and other em- 

bodiments which fall within the true spirit and scope of the present invention. The scope of the invention is to be 

determined by the broadest permissible interpretation of the following claims and their equivalents rather than the 

foregoing detailed description. 

Claims 

1. A method for performing, in parallel, a plurality of add-compare-select butterfly operations, the method comprising 
the steps of: 

loading a plurality of branch metrics Into a first storage element; 

loading a plurality of previous path metrics and a plurality of previous survivor paths into a second storage 
element; 

determining which of the plurality of the previous path nietrics are ttie sun/ivor paths using the plurality of 
branch metrics; 

performing a logical left shift on the sun/Ivor paths, inserting a plurality of logic bits at the most least significant 
bits of the survivor paths, the logical bits indicates which signals was most likely generated by a convolutlonal 
encoder and received by a Viterbi decoder; and 

storing the sun/ivor path metrics and the survivor paths in a third storage element. 

2. The method of claim 1 wherein while performing. the step of determining which of the plurality of the previous path 
metrics are the survivor paths, performing at least one of the following steps : 

(a) performing a toglcal left shift on the sunrivor paths which were generated in a previous add-compare-select 
butterfly operations, inserting a plurality of logic bits at the most least significant bits of the sun/ivor paths, the 
iogica! bits indicates which signals was nrK:>st likely generated by a convolutk>nal encoder and received by the 
VitertDl decoder and 

storing the sun/ivor path metrics which were generated in a previous add-compare-select operation in 
a third storage element; 

(Two) fetching branch metrics required to perform a next iterations of a plurality of add-compare-select butterfly 
operations, if these branch metrics were not previously fetched to the first storage element; and 
(Three) loading previous path metric and previous sun/ivor paths, required for performing a next iteration of 
L add-compare-select butterfly operatk>ns, into the second storage element. 

3. The method of claim 2. wherein the step of determining which of two previous path metres, out of the plurality of 
previous path metric, are the survivor paths comprises of the steps of : 

adding a first branch metric to a first previous path metric to obtain a first result, storing the first set of resuft 
in the first storage element, subtracting the first branch metric from the first previous path metrk; to obtain a 
second result, and storing the result in the second storage element; and 

comparing the first result and the second result, wherein the larger result is the sun/ivor path metric. 

4. The method of claim 2 wherein at least four add^jompare select operations are performed in parallel. 

5. A method for performing, in parallel, a plurality of add-compare-select butterfly operations, the method comprising 

the steps of: • 

loading a plurality of predetermined branch metrics and a plurality of output signals of a matched filter into a 
first storage element; 

loading a plurality of previous path metrics and a plurality of prevbus sun/ivor paths into a second storage 
element; 
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determining which of the plurality of the previous path metrics are the survivor paths using the plurality of 
predetermined branch metrics; wherein during the determination, producing a plurality of survivor path metrics; 
adding to the plurality of survivor path metrics a corresponding output signal of the matched filter; 
performing a logical left shift on the sun/ivor paths, inserting a plurality of logic t>its at the most least significant 
5 bits of the survivor paths, the logical bits indicates which signals was most likely corrupt by noise; and 

storing the path metrics associated to the survjvor paths and the survivor paths in a third storage element. 

6. The method of claim 5 wherein while performing either the step of determining which of the plurality of the previous 
path metrics are the survivor paths and the step of adding to the plurality of sun/ivor path metrics a corresponding 

10 output signal of the matched filter, performing at least one of the following steps : 

(a) performing a logical left shift on the survivor paths which were generated in a previous add-compare-select 
butterfly operations, inserting a plurality of logic bits at the most least significant bits of the sun^ivor paths, the 
logical bits indicates which signals was most likely corrupt by noise and 
IS storing the sun/ivor path metrics which were generated in a previous add-compare-select operation in 

a third storage element; 

(Two) loading a plurality of predetermined branch metrics and a plurality of 

output signals of a matched filter required to perform a next iterations of a plurality of add-compare-select 
butterfly operations, if these branch metrics were not previously fetched to the first storage element; and 
20 (c) ksading previous path metric and prevtous survivor paths, required for performing a next iteration of L add- 

compare-seiect butterfly operations, into the second storage element. 

7. The method of claim 6, wherein the step of determining which of two previous path metrics, out of the plurality of 
previous path metric, are the survivor paths comprises of the steps of : 

2$ 

addirig a first branch metric to a first previous path metric to obtain a first result, storing the first set of result 
in the first storage element, subtracting the first branch metric from the firsjt previous path metrk; to obtain a 
second result, and storing the result in the second storage element; and 

comparing the first result and the second result, wherein the larger result is the survivor path metric. 

30 

8. The method of claim 5 wherein at least four add-compare select operations are performed in parallel. 

$. A data processing system, in particular a digital signal processor, having means for performing a method of anyone 
or more of the preceding claims, 

55 

10. A computer program for performing a method of anyone or more of the preceding claims. 
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